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ON THE LOWER TAIL VARIATIONAL PROBLEM FOR RANDOM GRAPHS 


YUFEI ZHAO 


Abstract. We study the lower tail large deviation problem for subgraph counts in a random 
graph. Let Xh denote the number of copies of H in an Erdos-Renyi random graph Q(n,p). We are 
interested in estimating the lower tail probability P(A 'h < (1 — S)EXh) for fixed 0 < <5 < 1. 

Thanks to the results of Chatterjee, Dembo, and Varadhan, this large deviation problem has been 
reduced to a natural variational problem over graphons, at least for p > n~ aH (and conjecturally 
for a larger range of p). We study this variational problem and provide a partial characterization 
of the so-called “replica symmetric” phase. Informally, our main result says that for every H, and 
0 < 5 < Sh for some 8h > 0, as p —> 0 slowly, the main contribution to the lower tail probability 
comes from Erdos-Renyi random graphs with a uniformly tilted edge density. On the other hand, 
this is false for non-bipartite H and 5 close to 1. 


1. Background 


We consider large deviations of subgraph counts in Erdos-Renyi random graphs. Fix a graph 
H , and let Xh denote the number of copies of H in an Erdos-Renyi random graph Q(n,p). For a 
fixed 6 > 0, the problem is to estimate the probabilities 

(upper tail) P(A'# > (1 + 6)KXh) and 
(lower tail) F(X H < (1 - <5)EV H ). 

This problem has a long history (see |6| and its references). For the order of the logarithm of the 
tail probability, the upper tail problem is considered more difficult and it was resolved only fairly 
recently |6p2j. We are now interested in the finer question of determining the large deviation rate, 
or equivalently the first order asymptotics of the logarithm of the tail probability. 

Chatterjee and Varadhan Jl0| (the dense setting, with p constant) and more recently Chatterjee 
and Dembo [7] (the sparse setting, with p — > 0 and p > n~ aH for some an > 0) showed that this 
large deviation problem reduces to a natural variational problems in the space of graphons, which 
are a certain type of graph limits. We begin by reviewing this connection, and then we shift our 
attention to analyzing the variational problem. 

The language of graph limits is used throughout our discussion, so let us review some termi¬ 
nologies. We refer the readers to the beautifully written monograph by Lovasz [24] or the original 
25,261, for more on the subject. A graphon is a symmetric measurable function 


sources, e.g., 
12 


W : [0, l] 2 —> [0,1] (here symmetric means W(x,y ) = W(y, x)). We write V(H) and E(H ) to mean 
the vertex and edge set of a graph H , respectively, and v(H) = \V(H)\ and e(H) = \E(H)\ to 
denote their cardinalities. For any graphs H and G, we write hom(77, G ) to denote the number of 
graph homomorphisnrs from H to G. We denote by t{H , G) := hom(R, G)/v(G) v ^ the H-density 
in G. The R-density of a graphon W is defined by (here W could be M-valued): 


t(H,W) := 


/ TT W(xi,Xj) TT dxi. 

ijeE(H) — „ 


ieV{H) 
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As usual, Kt denotes the complete graph on t vertices. As an example, we have 


t(K 3 ,W) = 


'[o,i] 3 


W(x, y)W(x, z)W(y, z ) dxdydz. 


The notion of cut distance is mentioned a few times in this paper, but it is not used in a substantial 


way, so we refer the readers to 24, Chapter 8] for details. 

We write 

r , . x . 1 — x 

Ip(x) := x log - + (1 — x) log -- 

p 1 — p 

for the relative entropy function. For any function /, we write K[f(W)] := Jj 0 ^2 f(W(x,y))dxdy. 
We begin with a review of what is known for upper tails. In the dense case, for fixed 0 < p < 


q < 1 , it was shown in 10 that as n — > 00 , 


logP (t(H,g(n,p)) > q eiH) ) = -(l + 0 (l))^UT p (77,g), 


where UT P (H, q), for any graph 77, is given by the upper tail variational problem: 

UT p (H,q) : = 


minimize E[7 p (W)] 
subject to t(H,W) > q e ^ H \ 


( 1 . 1 ) 


( 1 . 2 ) 


Here W is taken over all graphons. We shall use UT p (H,q) to refer to the variational problem as 
well as its value, i.e., min{E[7 p (IT)] : t(H,W) > q e ^'\ (it is known that the minimum is always 
attained by some W; see Lemma 5.1 below). 

Furthermore, as shown in 110, Theorem 3.1], the set of minimizing W in UT p (H,q) represents 
the most likely models for G(n,p ) conditioned on the rare event of t(H,Q(n,p)) > q e ( H \ in the 
sense that the random graph conditioned on this rare event will be exponentially more likely to be 
close to the minimizing set of Ws in terms of cut distance. This motiviates the study of UT p (I7, q) 
and related variational problems. 


We currently have few tools for solving variational problems of the type (1.2). Note that W = q 


alway satisfies the constraint in (1.2). We focus on the basic question: does the constant graphon 


W = q minimize UT p (77, g)? The answer depends on the graph H and parameters (p,q). For a 
fixed 77, we wish to determine for each (p, q ) whether UT p (77, q) = I p (q) or UT p (77, q) < I p (q), and 
in the former case, whether the constant function W = q is the unique minimizei0 The separation 
of these two cases can be illustrated via a phase diagram, as in Figure [lj by plotting the phases in 
the (p, g)-plane according to the behavior of UT P (77, q ). 

The constant graphon W = q is the limit of random graphs G{n,q) as n —> 00 , so if it were 
the unique minimizer of UT p (H,q) then G(n,p ), conditioned on having 77-density at least q etyH \ 
approaches the typical G(n, q ) in cut distance; this is not the case when W = q is not a minimizer. 
Borrowing language from statistical physics, informally, when W = q is a minimizer we say that 
there is replica symmetr and otherwise there is symmetry breaking. 

In a previous paper with Lubetzky 28 , we completely identified the upper tail replica symmetric 


phase whenever 77 is a 7-regular graph. The phase diagram depends only on d. The diagram for 


^We identify graphons differing on a measure zero set, as well as up to a measure-preserving transformation on 
[0,1], i.e., W is identified with W rT (x,y) := W(a(x), a(y)) where a: [0,1] —» [0,1] is measure-preserving. 

2 There is a subtle issue of uniqueness of the minimizer. When the constant graphon W = q is the unique minimizer, 
<J(n,q) represents the most likely model for the conditioned random graph (in terms of cut metric). However, it may 
be the case that W = q is a non-unique minimizer (which provably does not happen for UT P (A'3, q) but I suspect that 
it does happen for the corresponding lower tail problem LT p (A'3,q)). When there are multiple distinct minimizers to 
the variational problem, all minimizers give rise to the same exponential rate, but one minimizer might still dominate 
by a lower order exp(o(n * 2 )) factor, which I do not know how to discern purely from the variational problem. 
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Figure 1. The phase diagram for triangle density upper tail variational problem 
\)T p (K 3 ,q) (when q > p) and lower tail variational problem U p (K 3 ,q ) (when q < 
p ). In regions marked the constant graphon W = q is the unique minimize!' 
to the variational problem. In regions marked the constant graphon does 

not minimize the variational problem. The region marked “?” is unresolved. The 
boundary curves q and q are from Theorem 


2.1 


H = K 3 is shown in Figure [I] in the upper portion (i.e., q > p) of the diagram]^ The lower portion 
of the diagram illustrates new results in paper concerning the lower tail problem. 


In this paper we study the corresponding lower tail variational problem. For 0 < q < p < 1, let 


f minimize E[/ p (IF)] 

\ subject to t(H,W) < q e ^ H \ 


(1.3) 


The connections between the large deviation problem and the variational problem discussed earlier 
hold for the lower tail just as they do for the upper tail. For example, as in (1.1), for fixed 
0 < 9 <p<l, we have 


2 

logP (t(H,g(n,p)) < q < H )) = -(l + o(l))yLT p (tf,g). (1.4) 

As observed in |28|, if H is a bipartite graph satisfying Sidorenko’s conjecture 36], which asserts 
that t(H,W) > E[IF] e ^ for all graphons W, then from the constraint of LT p (H,q) we deduce 
E[W ] < q. Since I p (-) is a convex function, from E[W] < q it follows that W = q is the unique 
minimizer of LT P (H, q). Sidorenko’s conjecture remains opeij^J though it has been proved for certain 
families of bipartite graphs H such as trees, cycles, hypercubes, and bipartite graphs containing one 


^The boundary curve for the upper tail phase diagram for K 3 is given by the equation (1 + (f / 1 — i) 1 /t 1 _ ' 2 < 3 '))p _ | 
4 The first unsettled case of Sidorenko’s conjecture is for the graph H being It's ,5 with a Hamiltonian cycle removed 
(this H is sometimes called a “Mobius strip”). There is some sentiment in the community that Sidorenko’s conjecture 
may be false for this graph. 
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vertex adjacent to all vertices on the opposite side 11 14 [21,23,37). Even if Sidorenko’s conjecture 
were false, it could still be true that W = q minimizes LT P (H, q) for every bipartite H. 

For the first non-bipartite case, namely AT 3 , new results in this paper partially characterize the 
lower tail phase diagram, as depicted in Figure [l] The region marked “?” remains unresolved. For 
other non-bipartite graph H, it is possible to draw similar partially identified phase diagrams using 
techniques in this paper. We will pay special attention to the slopes of the boundary curves at the 
origin. 

The lower tail variational problem seems to be harder than the corresponding upper tail problenj^J 
By analogy, for the classical extremal graph theory problem of determining the range of possible 
triangle densities in a graph of fixed edge density, the maximization problem (analogous to upper 
tail) follows as a corollary of the classic Kruskal-Katona theorem 0220 whereas the corresponding 


minimization problem (analogous to lower tail) was solved only relatively recently by Razborov [34] 
using his flag algebra machinery (also later solved for K 4 by Nikiforov [29] and all Kt by Reiher [35] ) . 
Furthermore, the qualitative nature of the phase transition seems to be different for the upper tail 
and the lower tail. It seems likely that the optimizing graphon W changes continuously as ( p , q) 
crosses upper tail phase boundary, while discontinuously for the lower tail. 

The sparse setting, with p = p n —» 0 and q/p kept constant, is more difficult. Using powerful 
new methods, Chatterjee and Dembo j7] showed that the large deviation problem in sparse random 
graphs also reduces to the natural variational problerr0 provided that p > n~ aH for some explicit 
an >0. A similar conclusion can be made about the lower tail variational problem using their 


techniques. With Lubetzky 27 we obtained the following asymptotic solution to the corresponding 


variational problem: for every fixed 5 > 0, 


lim 

p— 


UT p (R 3 , (1 + d) 1/3 p) 

p 2 log(l/p) 


= mm 




(1.5) 


and as a corollary, as long as p = p n —» 0 with p n > n 4 / 42 logn, we have 


P (t(K 3 ,Q(n,p)) > (1 + S)p 3 ) = exp -(1 - o(l)) min ■ 


d 2 / 3 5 


, - n 2 p 2 log - 


In this paper, we also study the lower tail variational problem as p -A 0. A nice feature of 
the lower tail problem in the sparse limit is that instead of being concerned with the entire phase 
boundary curve, we can focus on its slope at the origin^] 

The lower tail problem was also recently analyzed by Janson and Warnke jTT] from a completely 
different perspective (not relating to the variational problem). In the triangle case, for n~ 4 / 2 -C 
p —> 0, they were able to determine the large deviation rate of P(t(AT 3 , Q(n,p)) < (1 — 5)p 3 ) for the 
two extremes 6 = o(l) and <5 = 1 — o(l). They left as an open question what happens for fixed 
S 6 (0,1), which is the subject of this paper. 

There are other variants of the variational problem being studied in literature. For exponential 
random graphs, see Eiim 28’ 33, [39}|44| . For the variational problem where several subgraph 
densities are simultaneously constrained (e.g., edge and triangle densities both fixed), see [2, 19pO|- 
32 . 


'’despite that the probabilistic problem of determining the order o f log PlRAT G(n,p))) when p = p(n) 

®r 

>oi 1 


much later 
6 The proo: 


compared to the corresponding 
the triangle upper tail result in 


result, as we explain in Section [ 3 ] 


lower tail result |15||16 
28 


0 came 


actually uses some form of strengthening of the Kruskal-Katona 


^Some minor modifications needs to made to the formulation variational problem LT p (H,q) in order to match the 
statements in [7], namely that we only consider grpahons that correspond to weighted graphs on n vertices. This 
difference is minor and does not affect the rest of this paper. 

o 

For the upper tail boundary curve, the slope at the origin is always 1. 
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Section [2] contains statements of the results. Section [3] reviews the techniques used in proof of 

Section [4] concerns the upper tail problem for triangle densities. 


the upper tail results from 28 


Section [5] concerns general H- densities. The methods in Sections [4] and [5] are different since the 
techniques for triangles seem to be quantitatively superior but do not extend to all graphs. Section[6] 
concludes with some open problems. 


2. Results 


2.1. Triangle density. Here is our main result concerning the lower tail variational problem 
U p (I\^,q) for triangle densities. See Figure [I] 

Theorem 2.1. There exist functions q,q: (0,1) -A (0,1) satisfying 0 < q(p) < q(p) < p for 
0 < p < 1 with the following properties. Whenever q(p) < q < p, the constant graphon W = q is 
the unique minimizer for LT p (/i 3 ,q). Whenever 0 < q < q(p), the constant graphon W = q does 
not minimize LT p (il 3 , q). Furthermore, lim p _>o q(p)/p = 0.209 ... while linip^o q{p)/p = 0.466 .... 


The two curves q(p) and q(p) are drawn in Figure[lJ The nature of LT p (/i 3 , q) remains unresolved 
for ( p , q) betw een these two curves. 

In Theorem 


2.1 


and elsewhere, 0.466 ... denotes the unique 0 < r < 1 satisfying §r logr —r + 1 = 
0, and 0.209... is defined as the maximum value of r < 1 such that that the function f r (x ) in (4.8) 
(also see Figure [4]) has a zero in the open interval (0, r). 


2.2. General subgraph density. We extend Theorem 2.1 to general subgraph counts. No serious 
effort is made here at optimizing the quantitative bounds. 


Theorem 2.2. Let H be a graph. There exists a function q: (0,1) -A (0,1) with lim p _>.o q(p)/p < 1 
such that whenever q(p) < q < p, the constant graphon W = q is the unique minimizer for 
LT p (H,q). 

Furthermore, if H is not bipartite, then there exists a function q: (0,1) -A (0,1) with lim p _>.o q(p)/p 
0 such that whenever 0 < q < q(p), the constant graphon W = q does not minimize LT p (Fl 3 , q). 


The proof of the triangle case, Theorem 2.1 


makes use of Goodman’s inequality 



t(K 3 ,W) + t(K 3 ,l-W) > 1/4. 


If H satisfies t(H, W) + t(H, 1 — W) > 2~ e ^ +1 for all graphons W (such a graph H is sometimes 
called “common” in the context of Ramsey multiplicities), then the same method can be used to 
establish regions where W = q is a minimizer of LT P (R, q) (though the actual regions will not be the 
same as in Figure [I] due to other technical reasons). However, t(H, W ) + t(H, 1 — W) > 2~ e ^ +1 
does not hold in general. For example, Thomason |38] showed that Kf is a counterexample for 
all t > 4. Consequently, the proof method of Theorem 2.1 does not seem to extend to all H. 
Theorem 2.2 for general FI is proved using a different method, which seems quantitatively inferior 
to the method for triangles. 

For bipartite H , I conjecture that there is no phase transition: 


Conjecture 2.3. Let H be a bipartite graph. Then the constant function W = q is always the 
unique minimizer of LT p (iR q). 


As mentioned in the introduction, the conjecture holds for any H fo r which Sidorenko’s conjecture 
is true, i.e., t(H, W) > EflF] 6 ^ for all graphons W. Conjecture 
conjecture were not. 


2.3 


may be true even if Sidorenko’s 
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2.3. Sparse limit. Since I p is decreasing in [0, p] and increasing in [p, 1], any minimizing W for 
LT P (H, q) satisfies 0 < W < p almost everywhere. Let 

h{x) := x log x — x + 1 


so that 


limp 1 l p (px) = h(x ) 
o 


uniformly for x G [0,1]. It follows that for every graph H and 0 < r < 1 we have 


lim p- 1 IT J H, pr) = LT(H, r) (2.1) 

o 


where 


LT (H, r) 


minimize E[7i(W)] 
subject to t(H,W) < r e ^ H \ 


( 2 . 2 ) 


It would be interesting to solve this variational problem. As before, a basic question is whether the 
constant function W = r is a minimizer. Here is the main conjecture. 


Conjecture 2.4. Let H be a non-bipartite graph and 0 < r < 1. There exists a 0 < r* H < 1 so 
that W = r minimizes LT (H,r) if and only if r > r* H . Furthermore, W is the unique minimizer 
for LT (H,r) if and only if r > r* H . 

The conjecture remains open for any non-bipartite graph H. For the bipartite case: 


Conjecture 2.5. The constant graphon W = r is the unique minimizer for LT (H,r) for every 
bipartite graph H and every 0 < r < 1. 


In proving Theorem 2.1 and Theorem 2.2 we obtain the following results in the direction of the 
above conjectures. 


Theorem 2.6. If 0.466 •• • < r < 1, then the constant graphon W = r uniquely minimizes 
LT (K 3 ,r). If 0 < r < 0.209 ..., then the constant graphon W = r does not minimize LT(A _ 3 , r). 


I conjecture that, in Conjecture 2.4 


'k 3 


0.209.... 


Theorem 2.7. Let H be a graph. There exists Ih < 1 such that W = r uniquely minimizes 
LT (H,r) whenever ru < r < 1. If H is nonbipartite, then there exists r H > 0 such that W = r 
does not minimize LT (H,r) for 0 < r < r H . 


Combining these results with the framework of Chatterjee and Dembo [ 7 ], we obtain 


Corollary 2.8. Let H be a graph. There is some explicit an > 0 so that for p = p n —> 0 with 
p > n~ a , the following large deviation results hold. 

There exists Th < 1 so that for all r G (fn, 1); 

lim -^-logP (t(H,Q(n,p)) < (rp) e ^ H ' > ) = — h(r ). 

n —^oo Tl z p 


If H is non-bipartite, then there exists r H >0so that for all r G (0,r^), 


lim inf —^— 

n—Kx) n z p 


log ¥{t(H,Q(n,p)) < ( rp) e > —h(r). 


For H = A 3 , we may take rx 3 = 0.466 ... and r K;i = 0.209 ... . 
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3. Review of the proof for triangle upper tails 

We begin with a quick review of the proof of the upper tail result from |28|, as some of the ideas 
are used in the proof of Theorem |2.1| 


The following extension of Holder’s inequality is very useful. See 28, Corollary 3.2] 


Proposition 3.1. Let H be a graph with maximum degree A. For any symmetric measurable 
function W : [0, l ] 2 -A- M, we have t(H,W) < E[|W| A ] e ( H l/ A . In particular, t(K 3 ,W) < E[VF 2 ] 3 / 2 . 

The inequality can be proved via repeated applications of Holder’s inequality (when H = it 3 , it 
takes three applications of the Cauchy-Schwarz inequality). Observe that the inequality t(K 3 , W) < 
E[IT 2 ] 3//2 strengthens a corollary of the Kruskal-Katona theorem on the maximum possible triangle 
density in a graph of given edge density: t(K 3 , W ) < EfW] 3 / 2 . 

gives the full replica symmetric phase for UT p (it' 3 , q), the upper 


The following result from 


28 


tail problem for triangle densities. 

Theorem 3.2. Let 0 < p < q < 1. If the point ( q 2 ,I p (q )) lies on the convex minorant of the 
function x i—t I p (y/x), then W = q is the unique minimizer of UT p (A' 3 , q). 


The upper tail boundary curve in Figure [l] is characterized by the condition in Theorem 3.2 


See |28j Lemma 3.1] for the proof of symmetry breaking, i.e., UT p (A 3 , q) < I p (q), to the left of the 
boundary curve. 

Proof. By the convex minorant condition, the tangent line to the function I p (y/x) at x = q 2 lies 
below the function, so that 


Ip(Vx)>I p (q) + ^^-{x-q 2 ), Vx € [0,1]. 


Replacing x by x 2 , we get 


Ip{x)>I p (q) + J ^-(x 2 -q 2 ), Vx€[0,l]. 


(3.1) 


Note that I p (q) > 0 since q > p. 

Suppose graphon W satisfies W ) > q 3 . By Proposition ^. l[ we have E[1T 2 ] > t(K^, W ) 3 / 2 > 


q 2 . Thus (|3.l|) implies that 


E [I P (W)] > I p (q) + ^(E[W 2 ] - q 2 ) > I p {q). 

This shows that W = q is a minimizer for UT p (/i 3 ,g), and furthermore it is not too hard to check 
equality conditions to verify that this is the unique minimizer. ■ 


4. Triangle lower tails 


In this section we prove Theorems 2.1 and 2.6 


4.1. Replica symmetry phase. We begin with a small modification of Goodman’s theorem [13 
(which is usually generally stated for U + W = 1). 

Lemma 4.1. If U and W are graphon such that U + W > 2q for some constant q > 0, then 

t(K 3 ,W) + t(K 3 ,U)>2q 3 . 
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Proof. By decreasing U and IT (while remaining nonnegative), we may assume that they are 
graphons satisfying U + W = 2 q. Let U = q + X and W = q — X for some symmetric measurable 
function X: [0, l] 2 -A M. Then 

t(K 3 , IT) + t(I< 3 , U) = t(K 3 , q + X) + t(K 3 , q - X) 

= 2 q 3 + 6qt(Ki j2 ,X) 

> 2q 3 + 6q(E[X}) 2 > 2q 3 . ■ 


For any a £ M we write a+ := max{a,0}. In the Proposition below, a\ means (a+) 2 . The 
inequality (4.1) below is motivated by considering the tangent line to x i-a I p (2q — s/x) at x = q 2 , 


as in the proof of Theorem 3.2 


Proposition 4.2. Let 0 < q < p < 1 be such that 


ft Ipio) T 


-4(g) 

2g 


((2 q-x) 2 + -q 2 ) VsG[0,p]. 


(4.1) 


Then W = q is the unique minimizer of LT P (K 3 , q). 

Proof. Suppose W satisfies t(K 3 , W) < q 3 . Apply Lemma 


4.1 


to W and U := (2 q — W) + to obtain 


t(I< 3 , (2 q - W)+) > 2 q 3 - t(K 3 , W) > q 3 . 
Next, apply Proposition |3 .1 1 and we obtain 

E[(2 Q - W)\] > t(K 3 , (2 q - 1T)+) 2 / 3 > q 2 . 
By (4.1) we have (note that I' (q ) < 0 as q < p) 


ni p (W)} > Ip(q) + 


-i'p(q) 

2q 


(E[(2q-W) 2 + }-q 2 )>I p (q). 


It follows that LT p (K 3 ,q) = I p (q). To show that W = q is the unique minimizer, observe that in 
order for any other IT to be a minimizer, equality must occur at every step above. In particular, if 
(4.1) has single point of equality, namely for x = q, then the uniqueness of IT is clear. Otherwise, 
one can check (details omitted, but see Figure [2]) that that (4.1) has at most two points of equality, 
with one being x = q, so that if W has any positive mass with value being the other point of 
equality, then it would be impossible for t(K 3 ,W ) = q 3 to hold. This shows that IT = q is the 
unique minimizer. ■ 


To derive results about the phase diagram, we shall invoke various technical statements (referred 



Proof. Let 

P In') 

/(*) := f M := I p (x) - I p (q) + -^((2g - *) 2 + - q 2 ). (4.2) 

We plotted / for some representative values of (p, q) in Figure [2] 

Suppose f(p) > 0. We have 

f'{x) = I' p (x) - I ^-(2q - x) + 
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Figure 2. Plots of f p>q from (4.2) for p = 0.1 and various values of q. 


and 

fill \ _ jH ( \ i 1 , ^p(^) i 

/(*)-M*)+ q ll < 2 "- x (l_ x ) + q lx < 2 l- 

Since p < 1/2, f'(x) is decreasing for 0 < x < min{p,2g}. Clearly /" is positive near x = 0. We 
consider two cases. 

Case I: f"(x) > 0 for all 0 < x < min{p, 2 q}. Then / is convex on (0, min{p. 2 q}). We know that 
f(q) = f'(q) = 0. Then /(x) > 0 for all x G [0, min{p, 2q}\. If 2 q > p, then we are done, otherwise, 
note that /(x) = I p (x) — I p (q ) — qI' p (q)/2 for x G [2 q,p\, and it is decreasing on this interval. Since 

we assumed that f(p) > 0, we obtain /(x) > 0 for all x G [0,p]. 

Case II: there is some xq G (0, min{p, 2q}) such that f"(x o) = 0. So / is convex on (0, xo) and 

concave on (xo, min{p, 2q}). We assumed that f(p) > 0, so /(min{p, 2 q}) > 0 since if 2 q < p then 

/ is decreasing on (2 q,p). Since f(q) = f'(q ) = 0, an analysis of the convexity of / shows that it is 
nonnegative on [0,p]. ■ 


For the sparse limit p —> 0, the proof of the first half of Theorem 2.6 is nearly identical. It follows 
from the following two propositions, whose proofs we omit. 


Proposition 4.4. Let 0 < r < 1 be such that 

h{x) > h(r) H - ^-^-((2 r — x) 2 + — r 2 ) Vx G [0,1]. (4.3) 

Then W = r is the unique minimizer of LT ( K 3 , r). 


Fact 4.5. The inequality (4.3) holds for all x G [0,1] if and only if holds for x = 1, which holds if 
and only if r >r = 0.466 ..., where r satisfies h(r) + \rh'{r) = 0. 


4.2. Symmetry breaking phase. Now we explain the lower curve q in FigurefTj It is obtained by 
by restricting the variational problem LT p (A' 3 ,g) to graphons W of the form BIP a fe, where BIP a ^, 
for 0 < a, b < 1, is defined by 


a if (x,y) G [0,1/2] 2 U (1/2, l] 2 , 
b if (x, y) G [0,1/2] X (1/2,1] U (1/2,1] X [0,1/2]. 


(4.4) 


There is symmetry breaking as long as we can find 0 < a, b < p satisfying 

E[/ p (BIP a , 6 )] = h P (a) + \l P (b ) < I p (q ) (4.5) 

and 

t(K 3 , BIP a>fe ) = Ja 3 + ^ab 2 < q 3 . (4.6) 

We can assume that 0 <a<q<b<p, since otherwise swapping a and b reduces t(Ks, IT) (observe 
that t(K 3 , BIP ai b) — t(K 3 , BIPfe ia ) = \{a — b ) 3 ) while keeping E[/ P (IT)] constant. 
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Figure 3 . The plot of / Pi9 


from (4.7) for p 


0.1 and various values of q. 



Set b = yj (4 q 3 — a 3 )/(3a) so that t(K^, BIP ai ft) = q 3 . There is symmetry breaking if 

- IpW) 


f( x ) fp,q( x ) 2 Ip( x ) T 2 Ip 


I 4 q 3 — x 3 


3x 


(4.7) 


is negative for some 0 < x < q, where / is only defined for yj (4 q 3 — x 3 )/{3x) < p. Some represen¬ 
tative examples of / are plotted in Figure [3} For every p. and sufficiently small q, f{x) becomes 
negative in a region away from x = q. 

Now we prove the claims in Theorem |2.1| more rigorously. For every p > 0, if q is sufficiently small 
so that ^/p(0) < I p (q), then W = BIPo, p satisfies t(K^,W) = 0 while E[I p (W)] = \l p { 0) < I p (q), 
so that LTp(A'3, q) < I p {q). 

The argument in the previous paragraph does not give the optimal q in Theorem 


2.1 


To 

prove that q can be chosen so that lim^o q(p)/p = 0.209..., it suffices, by (2.1), to prove the 
second half of Theorem 2.6, that LT(A’3, r) < h(r) for all r < r\ = 0.209.... As before, we seek 
0 <a<r<b<l with 

2 Ha) + \h(b) < h(r) 


and 


Let 


\a 3 + ^ab 2 < r 3 . 
4 4 


/(*) : = fr( x ) ■= \K X ) + \ h 


1 4 r 3 — x 3 


3x 


— h{r). 


(4.8) 


See Figure |4| for some examples of plots of f, (as before, plotted for values of x < r satisfying 
vw — x 3 )/(3x) < 1). At the critical r = r\ = 0.209..., there exists 0 < ai < r\ < b\ < 1 such 
that |af + \a\b\ = r 3 and | h(a \) + \h{b\) = h(r\). Now for any 0 < r < ri, let s = rjr\ , so that 
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(a, b) = (. sa±,sbi ) satisfies ^a 3 + | ab 2 = ?’ 3 . Note that 
h(sx) = sx log(sx) — sx + 1 

= s(.Tlog® — x + 1) + (slogs)x — s + 1 = sh(x) + (slogs)x — s + 1. 


We have 


^h(a) + ^h(b) - h{r ) = ^h(sai) + ^h(sbi) - h{sr \) 


= s(hi( a i) + ^h(b i) - h(n)) + (slogs)(^ai + - n) 

< 0 


where we know (ai + b\)/ 2 > r from 


1 


1 , 


1 


1 3 3 2 


«i + xM -ri= -ai + -6i - 7 a x --oih 


= (^i-2 a i) >0 


It follows that LT(/i 3 , r) < I p (r) for all 0 < r < r\ = 0.209.... 


5. General subgraph lower tails 


I will give the details only for Theorem 2.7 


con- 


In this section we prove Theorems 2.2 and 2.7 
cerning the sparse limit LT(i7, r) as it is somewhat cleaner and contains all the ideas. Theorem 2.2 


regarding LT P (H, q ) can be proved analogously by considering sufficiently small but fixed values of 

p. 


5.1. Replica symmetry. For any graph H and graphon W, we define the functional derivative 
t'(H, W ) to be the symmetric measurable function given by 

t'(H, W) = (5.1) 

e£E(H) 

where for each ab E E(H), we define the graphon 

t ab (H, W)(x a ,x b ) := [ TT W(xi,Xj) TT dxi. (5.2) 

J[ o,l]V(U)\{a,b} ij£E ( H ^ ab j i£V(H)\{a,b} 

For example, 

t'(K 3 , W)(x, y) = 3 f W(x, z)W(y, z ) dz. 

J [o,i] 

For any symmetric measurable U : [0, l] 2 —> [—1,1], and 5 —> 0, we have 

t(H , W + SU) = t(H , W) + 5E[1?(H, W)U} + 0(<5 2 ), 
which justifies calling t\H, W) the functional derivative. 

Lemma 5.1. Let H be a graph and 0 < r < 1. The variational problem LT (H,r) attains its mini¬ 
mum for some graphon W, and any such W satisfies the following Lagrange multiplier condition: 
for some A > 0, one has 

h'(W(x,y)) + \t'(H,W)(x,y) = 0, 


a.e.-(x,y) <G [0, l] 2 . 
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Proof. That the minimum of LT( H , r) is always attained follows from the compactness of the space 
of graphons with respect to the cut distance and the convexity of h, as was already observed in 101 rj 
Suppose W minimizes LT (H, r). We claim that for any symmetric measurable function U: [0, l] 2 -A 
[—1,1] such that 0 < W + U < 1 and E[t'(H, W)U] < 0, one has E[h'(W)U\ > 0. Indeed, consider 
the graphon W + 5U for <5 \ 0. One has 

t(H, W + 5U)- t(H, W ) = 5E[t'(H, W)U ] + 0{5 2 ). 

Thus t(H, W + 5U) < t(H , W) < r e ^ for sufficiently small <5 > 0, and hence E [li(W + 5U )] > 
E[/i(W)] since W minimizes LT (H,r). On the other hand, 

lim E lHW + «J)-h(W)] 

<5—>-0 0 

so that E [h'{W)U\ > 0 as claimed. The interchange of limit and expectation above can be justified 
by writing U = U+ — [/_, where [/+ = max{[7, 0} and U- = max{— U, 0}. Since h is convex, 
(h(W + 5U+) — h(W))/6 is pointwise monotonically decreasing as <5 \ 0, and likewise ( h(W — 
5U-) — h(W))/5 is pointwise monotonically increasing. So the interchange of limit and expectation 
follows from the monotone convergence theorem. 

The lemma follows easily from the claim we just proved. ■ 


Lemma 5.2. Let H be a graph with m edges, and 0 < r < 1. Let W minimize LT (H,r). Then 
W > r mr m almost everywhere. 


Proof. Let c = r mr Suppose on the contrary that W < c on a set of positive measure. Let 
A be the Lagrange multiplier in Lemma 5.1 From (5.1) we have t'(H,W) < m everywhere. By 
considering a positive-measure set of (x,y) such that W(x,y) < c, we find 

0 = ti(W{x,y)) + \t'(H,W)(x,y) < ti(c) + m\. 

So that 


A > 


-h'(c) log(l/c) 


m m 

Therefore, up to a set of measure zero, for every (x,y) with W(x,y) > r m , we have 
t!(H W)(x v) ~ < -™h'(r m ) _ m log(r~ m ) _ 

t{M,W)[X,y)- x < lQg(1/c) - log(r -mr-™) - ^ ■ 

On the other hand, for every (x,y) with W(x,y) < r m , we have t'(H,W)(x,y)W(x,y) < mr m . 
Thus t'{H,W)W < mr m almost everywhere. By ( |5.1[ ) and (5.2), we have 

t(H,W) = —E [t'(H,W)W] < r m . 
m 

However, any W with t(H,W ) < r m cannot minimize LT (H,r). This gives the desired contradic¬ 
tion. ■ 


Lemma 5.3. Ift(H,W ) < r e ( H \ then E[logVF] < logr. 


y We sketch here an alternate proof of the fact that the minimum is always attained. Let W n be a s eque nce of 


LT(J1, r). By compactness of the space of graphons 


26 


graphons with t(H, W„) > and E \h(W n )] —> 

exists a subsequential limit W so that 5n{W„, W) 
have t(H,W„) t(H,W), so that t(H,W) > r e ^ H f It remains to show that E[/i(IF)] < limE[/i(Wn)] = LT (H.r). 

We do not lose anything by assuming that \\W n — W\\ D —>• 0. Let P m denote the partition of the unit interval 
[0,1] into m equal-length intervals. Let Wp m denote W with its value inside each 7; x Ij replaced by its average, 
for every Ii,Ij € Vm- Define (W n )v m similarly. Then \\W n — W\\ D - 
a.e. as n —> oo. By convexity, we have limn-nx, E[/i(W n )] > liminf^-^c 
Wvm —» W pointwise a.e. by the Lebesgue density theorem, so liin„ 

E [h(W)\ < limE[ft(W„)] = LT(77, r), as desired. 


there 

0 along some subsequence. Restrict to this subsequence. We 


► 0 implies that (W„)v m 1Fp m pointwise 
,E [h((Wn)rm)] = E[h( 1 b /: p m )]. Furthermore, 
^ooE [h(W Vm )\ = E [h(W)\. It follows that 
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m 3 

4 

5 

6 

7 

8 

9 

10 

20 

100 

r m 0.686 

0.735 

0.770 

0.795 

0.815 

0.831 

0.844 

0.855 

0.911 

0.973 


Table 1. Some values of r m 


from Proposition 5.4 


Proof. The lemma follows from Jensen’s inequality: 


mEflogJT] = 




log W(xi,Xj)\ dxi 


K ijGE(H) 


ieV(H) 


< log | / W(xi,Xj) dxi J = \ogt{H, W) < mlogr. ■ 

[°J]' (H) ij£E(H) idV (H) J 

Proposition 5.4. Let H be a graph with m > 1 edges. Let r = r m (see Ta61e[7p be the unique 
solution in the interval (0,1) to the equation 

h(r mr ) = h(r) + r' h(r)(\og{r mr ) — logr) 

Then LT (H,r) is uniquely minimized by the constant function W = r for all r > r m . 


Proof. Let r > r m . Let W be a minimizer for LT (H,r). By Lemma 


5.2 


W > r r 


everywhere. Thus it follows by Fact |5.5| below (and it can be checked that r m > 1/e) that 

h(W ) > h{r ) + r log r(log W — log r) 


a.e. 


almost 


(5.3) 


Taking expectation of both sides and using E[log IT] < logr from Lemma [5.3| (note that logr < 0), 
we obtain E[h(VF)] > h(r), as desired. To see that W = r is unique, suppose W is another 
minimizer of LT(iL,r). Equality must hold everywhere in the argument. In particular, (5.3) must 
hold almost everywhere, which easily implies that W = r (for the critical case r = r m , W might 
also take the value r mr ™, but only on a set of measure zero since E[/i(JF)] = h(r)). ■ 

Fact 5.5. If 

h(x) > h(r ) + rh'{r){ logx — logr) (5-4) 

holds for some ( x,r ) = (xo,ro), with 0 < .To < ro < 1 and r$ E [1/e, 1], then it holds for all 
C x,r ) E [x 0 , 1] X [r 0 ,1]. 


Proof. The partial derivative of the RHS of (5.4) with respect to r is —(1 + logr)(logr — logx), 


(x, r) = (xo,ro) then it automatically holds for (x,r) = (xo , r) for all r E [ro, 1]. 
Let us now fix r. Let 


which is at most zero as long as x < r and r > 1/e. This shows that if (5.4) holds for some 

(5.5) 


f(x) := f r (x ) := h{x) — h(r) — rh'(r)(logx — logr). 
Some examples of f r are plotted in Figure [5] We have 


fix) = logx - 


r logr 


and 


f"(x) = 


x + r log r 


x x* 

So f"{x) < 0 for x < —rlogr and f"(x) > 0 for x > —rlogr. Note also that /(r) = f'(r) = 0, 
and —rlogr < r as long as r > 1/e. By analyzing the convexity of / (see Figure [5]), we see that 
f(x o) > 0 implies f(x) > 0 for all x E [xq, 1]. ■ 
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Figure 5. Plot 


of f r from (5.5) for various values of r. 


5.2. Symmetry breaking. The proof of the second part of Theorem 2.7 is easy. One can fine 


tune the bounds as in Section 4.2 though we omit the analysis here. 

Proposition 5.6. Let H be a nonbipartite graph. Then LT (H,r) < h{r) for all r < 0.186. 


Proof. The graphon W = BIPo,i satisfies t(H,W) = 0, and E[/i(W)] = 0), which is strictly less 

than h{r) for all r < 0.186. ■ 


6 . Open problems 

We conclude with some open problems concerning the variational problem for upper and lower 
tails. 

• Upper tail phase diagram. Determine the upper tail replica symmetry phase diagram 
for non-regular H. 

• Lower tail phase diagram. Determine the lower tail replica symmetry phase diagram 
for JP 3 , and more generally for any non-bipartite graph H. In particular, determine r* H 
from Conjecture |2.4| For a bipartite graph H, determine whether there is replica symmetry 
everywhere (Conjecture |2.3[ ). 

• Solution in the symmetry breaking phase. Solve the variational problem UT or LT 
at any non-trivial point where the constant graphon is not a minimizer. 

Acknowledgments. I would like to thank Eyal Lubetzky for helpful discussions. 
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